Models and Issues in Consistent Biclustering

نویسندگان

  • O. Erhun Kundakcioglu
  • Artyom Nahapetyan
  • Stanislav Busygin
  • Panos M. Pardalos
چکیده

Biclustering is a methodology allowing simultaneous partitioning of a set of samples and their features into classes. Samples and features classified together are supposed to have a high relevance to each other which can be observed by intensity of their expressions. The notion of consistency for biclustering is defined using interrelation between centroids of sample and feature classes. Consistent biclustering also implies separability of the classes by convex cones (see [Busygin et al. (2005)]). Previous works on biclustering concentrated on unsupervised learning and did not consider employing a training set, whose classification is given. However, with the introduction of consistent biclustering, significant progress has been made in supervised learning as well. A dataset (e.g., from microarray experiments) is normally given as a rectangular m× n matrix A, where each column represents a data sample (e.g., patient) and each row represents a feature (e.g., gene)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extending the definition of beta-consistent biclustering for feature selection

Consistent biclusterings of sets of data are useful for solving feature selection and classification problems. The problem of finding a consistent biclustering can be formulated as a combinatorial optimization problem, and it can be solved by the employment of a recently proposed VNS-based heuristic. In this context, the concept of β-consistent biclustering has been introduced for dealing with ...

متن کامل

Biclustering Gene Expressions Using Factor Graphs and the Max-Sum Algorithm

Biclustering is an intrinsically challenging and highly complex problem, particularly studied in the biology field, where the goal is to simultaneously cluster genes and samples of an expression data matrix. In this paper we present a novel approach to gene expression biclustering by providing a binary Factor Graph formulation to such problem. In more detail, we reformulate biclustering as a se...

متن کامل

The Effect of Monetary Policy on Business Cycles in Iran Economy

Nowadays one of the most important issues in our economy, both from economic and political view is the link between monetary policy and business cycle fluctuations. Amongst the shocks related to the supply side, the shock of oil price is the important factor that has affected the world economy since the 1970s. This paper examines the effects of monetary policy and oil price shocks on the busine...

متن کامل

Finding checkerboard patterns via fractional 0-1 programming

Biclustering is a simultaneous partitioning of the set of samples and the set of their attributes (features) into subsets (clusters). Samples and features clustered together are supposed to have a high relevance to each other. In this paper we provide a new mathematical programming formulation for unsupervised biclustering. The proposed model involves the solution of a fractional 0-1 programmin...

متن کامل

Multiple Structure Recovery via Probabilistic Biclustering

Multiple Structure Recovery (MSR) represents an important and challenging problem in the field of Computer Vision and Pattern Recognition. Recent approaches to MSR advocate the use of clustering techniques. In this paper we propose an alternative method which investigates the usage of biclustering in MSR scenario. The main idea behind the use of biclustering approaches to MSR is to isolate subs...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007